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Satisfiability is a classic problem in computational complexity theory, in which one wishes to determine 
whether an assignment of values to a collection of Boolean variables exists in which all of a collection of 
clauses composed of logical OR's of these variables is true. Here, a renormalization group transformation is 
constructed and used to relate the properties of satisfiability problems with different numbers of variables in each 
clause. The transformation yields new insight into phase transitions delineating "hard" and "easy" satisfiability 
problems. 
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Computational complexity theory addresses the 
question of how fast the resources required to solve a 
given problem grow with the size of the input needed 
to specify the problem. Q| P is the class of problems 
that can be solved in polynomial time, which means a 
time that grows as a polynomial of the size of the prob- 
lem specification, while NP is the class of problems 
for which a solution can be verified in polynomial time. 
Whether or not P is distinct from NP has been a cen- 
tral unanswered question in computational complexity 
theory for decades. 0] 

Satisfiability (SAT) is a classic problem in compu- 
tational complexity. An often-studied type of SAT is 
K-SAlT, in which one attempts to find assignment of 
N variables such that the conjunction (AND) of M 
constraints, or clauses, each of which is the disjunction 
(OR) of K literals, each literal being either a negated 
or un-negated variable, is true. (This way of writing the 
problem, as a conjunction of clauses that are disjunc- 
tions, is called conjunctive normal form.) For example, 
the 3-SAT instance with the 4 variables x\, x 2 , x§, and 
xa and the four clauses 
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is satisfiable because it it is true for the assignments 
x\ = \,x 2 = 1, £3 = 1, X4 = 1. Below, we will write 
satisfiability problems in conjunctive normal form us- 
ing the notation of |3], where the ANDs and ORs are 
implied and the literals have positive or negative signs 
depending on whether or not they are negated. For ex- 
ample, the expression of Eq. Q is written 



(1 -2 4), (-1 3 -4), (2 3 4), (-1 -3 4) . 

2-SAT can be solved in polynomial time 01 , while 
K-SXT with K > 3 is known to be NP-complete |4]: 
if a polynomial algorithm for 3-SAT exists, then P is 
equal to NP. The complexity of SAT is intimately re- 
lated to the presence of phase transitions H 0, 0, HI El 
Eel EH. For random problems with N variables, M 
clauses, and K literals per clause, as M is increased 
there is a phase transition from a satisfiable phase, in 
which almost all random instances are satisfiable, to 
an unsatisfiable phase, in which almost all random in- 
stances are unsatisfiable. The most difficult instances 
are near this SAT-unSAT transition. It has also been 
shown that there is a transition as the parameter K is 
changed between 2 and 3, at K c ~ 2.4, at which the 
nature of the SAT-unSAT transition changes |5]. 

Here, we investigate the relationship between satis- 
fiability problems with different values of K by con- 
structing a renormalization group transformation, sim- 
ilar to those used for phase transition problems 1 12, 

El 

E1E1 

, that reduces the number of degrees of freedom, 
while possibly increasing the number and range of in- 
teractions 1 16] (which in this context is the number of 
literals per clause). To do this, we note that the expres- 
sion 
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is satisfiable. Here, the Ai's and Bj's are arbitrary 
clauses and x is a variable. (The easiest way to see the 
equivalence is to note that both expressions are satisfi- 
able if and only (Ai AND A 2 AND . . . A P ) OR (Bi 
AND B 2 AND . . . Bq) is.) The first step of the renor- 
malization procedure is to use this identity to elimi- 
nate a given variable. In this step, P clauses in which 
a given variable comes in un-negated and Q clauses in 
which the same variable comes in negated are elimi- 
nated and replaced with PQ "resolution" 11711 clauses. 
Thus, eliminating a "frustrated" 11811 variable (one that 
enters into different clauses negated and un-negated) 
increases the number of clauses if PQ-(P+Q)>0. The 
resolution of two clauses of length Ki and Kj has 
length Ki + Kj — 2. Note that resolving two 2-clauses 
yields a 2-clause, resolving a 2-clause with a clause of 
length JC > 3 yields a clause length JC, and resolving 
two clauses of with lengths JC\ > 3 and JC 2 > 3 yields 
a clause with length greater than both /Ci and JC 2 . 

One then simplifies the resulting satisfiability ex- 
pression by noting that 

1 . Duplicate clauses are redundant, 

2. Duplicate literals in a given clause are redundant, 

3. If a variable enters into one clause both negated 
and un-negated, then the clause must be true and 
can be removed, 

4. If a clause has one literal, then the value of the 
corresponding variable is determined, and 

5. If a subset of the literals in a clause comprise a 
different clause, then the clause with more liter- 
als is redundant. 

This last point means, for example, that if an expres- 
sion contains both (1 3-4 5) and (1 3), then (13-4 5) 
can be removed, because it is satisfied automatically if 
(1 3) is satisfied. 

This procedure is known in computer science as "the 
Davis-Putnam procedure of 1960 1 19] with subsump- 
tion HjJ," and was originally proposed as a method for 
solving satisfiability instances. It does not perform well 
in practice 12 IB . and has been prov en to require expo- 
nential time on some instances 1 22, 23]. However, here 
the aim is not to solve a given instance, but rather to 
investigate the "flow" of the problem itself as variables 
are eliminated 013. In particular, this renormal- 
ization group (RG) transformation provides a natural 



framework for understanding a phase transitions be- 
tween "easy" and "hard" satisfiability problems iden- 
tified in H. 

We present evidence that the change in the nature of 
the SAT-unSAT phase transition critical value K c ~ 
2.4 is intimately related to whether or not the num- 
ber of clauses proliferates exponentially upon repeated 
application of the renormalization group (RG) trans- 
formation. Note that when K = 2 the clause length 
decreases upon renormalization, since the resolution of 
two 2-clauses is a 2-clause, so no clause gets longer, 
and some of the resulting clauses have a duplicate lit- 
eral and so get shorter. Having a large number of 2- 
clauses limits the growth in the number of long clauses 
because of subsumption, so there is a qualitative dif- 
ference in the behavior depending on whether the ratio 
of the number of 2-clauses to the number of variables 
grows or shrinks upon renormalization. 

We show numerical data for an RG implementa- 
tion in which successive variables are chosen randomly 
and eliminated if they occur in a clause of minimum 
length. This procedure is used because it focuses on 
short clauses, which are much more restrictive than 
long clauses. Figure ^ shows or-, the ratio of Mjc, 
the number of clauses of length K to N, the number 
of variables remaining in the problem, as a function 
of K, as the RG proceeds. The average and standard 
deviation of numerical data from 5 realizations at the 
SAT-unSAT transition with p = 0.2 and p — 0.6 are 
shown (using parameter values for the transition loca- 
tions from 1 5]). Large numbers of long clauses are gen- 
erated whenp = 0.6 > p c and not whenp = 0.2 < p c . 

It has been proven that the SAT-unSAT transition for 
2-SAT occurs at a = 1 Q, and for 2 < K < 2.4, the 
SAT-unSAT transition is believed to occur when a 2 — 
1 H. Figure |2 (left) shows that when K = 2.2, a 2 
increases upon renormalization when a 2 > 1 and de- 
creases upon renormalization when a 2 < 1. Because 
adding additional three-clauses does not affect the be- 
havior of the two-clauses, and because two-clauses are 
much more restrictive than longer clauses, the two- 
clauses dominate the problem whenever a 2 > 1. One 
would expect that 3-clauses by themselves would pro- 
liferate when N i n iti a i, the initial number of variables, 
was equal to 3Af 3 /2. (This estimate, the analog of the 
result for two-clauses, follows from setting the number 
of literals in all 3-clauses to twice the number of vari- 
ables, which means that on average each variable enters 
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FIG. 1: Plot of a.K, the ratio of Mr, the number of clauses of 
length K, to N, the number of variables, at different stages 
of the renormalization process. The points plotted are the 
mean and standard deviation of the results from five indepen- 
dent system realizations. The parameters are chosen to be at 
the SAT-unSAT transition with p = 0.2 < p c (left panel) 
and p — 0.6 > p c (right panel). When p > p c the clause 
length increases markedly and the number of clauses grows 
enormously. 



into one 3-clause negated and one 3-clause un-negated. 
Moreover, eliminating one variable on average yields 
two less literals and one less variable, so that, ignoring 
fluctuations, the relationship remains true.) However, 
because the 2-clauses prevent the 3-clauses from prolif- 
erating, adding 3-clauses to the 2-clauses changes the 
nature of the SAT-unSAT transition only when enough 
3-clauses have been added so that the SAT-unSAT tran- 
sition occurs with «2 < 1. In this regime, under renor- 
malization the 2-clauses disappear and so the clauses 
all become longer. Since very long clauses are ORs of 
many literals and hence easy to satisfy, the number of 
clauses must go up sufficiently fast for the problem to 
be difficult to solve — at large K, the SAT-unSAT tran- 
sition occurs when the ratio of the number of clauses to 
the number of variables is oc 2 K O- WhenX > K c , 
near the SAT-unSAT transition we expect the number 
of clauses to grow geometrically with iteration number, 
and the numerical data are consistent with the maxi- 
mum number of clauses obtained during the renormal- 
ization process increasing exponentially with the initial 
number of variables, with an exponent that increases 
monotonically with {M/N) initiaU the initial ratio of 
the number of clauses to the number of variables. Fig- 
ure 2 (right) shows a phase boundary lines for the SAT- 
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FIG. 2: Left: plot of oli, the ratio of the number of 2-clauses 
to the number of variables, versus N, the number of undeci- 
mated variables, for instances with p = 0.2 and different ini- 
tial values of a. The numerical data are averages and standard 
deviations of five realizations of systems of size 500 when the 
initial a = 1, size 400 when the initial a = a c — 1.2, 
and size 300 when the initial a — 1.67. The numerical 
data are consistent with the hypothesis that when K < K c 
the SAT-unSAT transition occurs when 02 neither decreases 
nor increases upon renormalization. Right: Schematic phase 
diagram showing the SAT-unSAT transition (using data of 
Ref. |31), the region in which the number of 2-clauses in- 
creases upon renormalization (the red hatched region in the 
left of the figure) and an estimate of the region in which the 
number of clauses with K > 3 increases upon renormaliza- 
tion (the blue hatched region in the right of the figure). The 
SAT-unSAT transition line crosses into the region in which 
long clauses proliferate exponentially at the intersection of 
the three lines. The estimate for 3-clause proliferation given 
in the text yields an intersection at (K — 2.4, a = 1.25). 



unSAT transition, for the onset of increase in the num- 
ber of 2-clauses, and our estimate for the onset of pro- 
liferation of clauses of length greater than or equal to 
three. 

When K > K c , the SAT-unSAT transition HH0, 
HUES occurs in a regime in which the renormal- 
ization transformation causes both the typical clause 
length and the total number of clauses to grow. We 
conjecture that the SAT-UNSAT transition occurs at 
the value of M/N at which the rate of exponential 
growth is a critical value. A "replica-symmetry break- 
ing" transition at a somewhat smaller 
value of a can be interpreted in terms of propagation of 
constraints on eliminated literals, as will be discussed 
elsewhere. 1 27] Because of the exponential clause pro- 
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liferation, numerical investigation of these transitions 
using this renormalization group is limited to small 
sizes. However, the renormalization group may still 
be useful for investigating these transitions using an- 
alytic techniques appropriate for large K IKI Hit for 
any K > K c , though it will be necessary to understand 
how to account for possible RG-induced correlations 
between clauses. 

When K > K c , the number of clauses continues to 
increase under renormalization until it is no longer un- 
likely that a given compound clause contains a repeated 
variable (in addition to the decimated one), which we 
expect to occur when the renormalized clause length is 
of order y/~N j28|- Because it appears that there is no 
impediment to the growth in the effective value of K 
until it is of order y/N, where the problem specifica- 
tion itself is exponentially large in N, it appears that 
the renormalization group procedure can transform the 
problem out of NP and even PSPACE altogether. This 
property may indicate that whether or not a given com- 
putational problem has a solution that can be verified 
using polynomially-bounded resources has no funda- 
mental effect on the difficulty of solving the problem. 

In summary, a transformation inspired by the renor- 
malization group is constructed and used to relate the 
behavior of satisfiability problems with different val- 
ues of K, the number of literals per clause. The trans- 
formation provides useful insight into previously iden- 
tified phase transitions of satisfiability problems and 
may yield new insight into the question of whether or 
not P is equal to NP. 
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